Mon. Not. R. Astron. Soc. 000, [THT2] (2010) Printed 6 March 2013 (MN M£X style file v2.2) 



Calibration of X-ray absorption in our Galaxy 



R. Willingale*, R.L.C. Starling, A. P. Beardmore, N.R. Tanvir and P.T. O'Brien 

C^") , University of Leicester, Department of Physics and Astronomy, University Road, LEI 7RH, UK 



o 

CN 



Oh 

6 

c3 



> 

oo 
q 

o 



X 



Accepted Received ; in original form 



ABSTRACT 

Prediction of the soft X-ray absorption along lines of sight through our Galaxy is 
crucial for understanding the spectra of extragalactic sources, but requires a good es- 
timate of the foreground column density of photoelectric absorbing species. Assuming 
uniform elemental abundances this reduces to having a good estimate of the total 
hydrogen column density, Njjtot = Nhi + 2iVjj 2 . The atomic component, Njji, is re- 
liably provided using the mapped 21 cm radio emission but estimating the molecular 
hydrogen column density, Nh 2 , expected for any particular direction, is difficult. The 
X-ray afterglows of GRBs are ideal sources to probe X-ray absorption in our Galaxy 
because they are extragalactic, numerous, bright, have simple spectra and occur ran- 
domly across the entire sky. We describe an empirical method, utilizing 493 afterglows 
detected by the Swift XRT, to determine Nntot through the Milky Way which provides 
an improved estimate of the X-ray absorption in our Galaxy and thereby leads to more 
reliable measurements of the intrinsic X-ray absorption and, potentially, other spectral 
parameters, for extragalactic X-ray sources. We derive a simple function, dependent 
on the product of the atomic hydrogen column density, Nhi, and dust extinction, 
E(B — V), which describes the variation of the molecular hydrogen column density, 
Nh 2 , of our Galaxy, over the sky. Using the resulting Nntot we show that the dust- 
to-hydrogen ratio is correlated with the carbon monoxide emission and use this ratio 
to estimate the fraction of material which forms interstellar dust grains. Our resulting 
recipe represents a significant revision in Galactic absorption compared to previous 
standard methods, particularly at low Galactic latitudes. 

1 INTRODUCTION v^i Z x az ,,x(l-fe,)xa Zi „ (3) 

Understanding the soft X-ray spectra of extragalactic 
sources requires an estimate of the foreground photo-electric 

absorption due to gas and dust in the Milky Way. The to- where the relative abundance of element Z with respect to 
tal photo-ionization cross-section which gives rise to X-ray hydrogen is Az — Nz /Nntot and the fraction of ions of this 
. absorption in the ISM can be written as the sum of contri- element in the i th ionization state is az,i = Nz,i/Nz and 
butions from 3 phases oz,i is the total photo-ionization cross-section of this ioniza- 

tion state. The fraction of ions in the gas phase is 1 — fiz,i so 
(Tism = Cgas + ^molecules + Cdust- (1) /3z,i is the fraction in other phases (usually dominated by the 

fraction in dust grains). Atoms deep within large grains are 
It is conventional to normalise the cross-section with respect shielded by the outer layers and therefore their contribution 
to the total hydrogen column density, Nntot cm -2 (molecu- to the total cross-section is reduced. Similarly, some atomic 
lar, neutral atomic and ionized), such that the X-ray spec- absorption edges are modified if the atoms are incorporated 
trum observed at energy E is into molecules and the molecular cross-section is not a sim- 

ple sum of the cross-sections of the constituent atoms/ions. 
1(E) = exp(— a j $ m(E)N Htot)Io(E) (2) If the ISM is warm then higher ionization states are excited 

and the az,% values will be modified. But all these factors 
where Io(E) is the source spectrum. Absorption in the soft only introduce small changes and what really dominates the 
band, 0.1-2 keV, is included in spectral fitting using a de- total cross-section are the abundances of the atomic types, 
tailed model of the cross-section, ctism, and the hydrogen Az- It is usually assumed that these abundances have fixed 
column density, Nntot, is usually a fitted parameter. The values across the Galaxy and so if we fix the value of Nntot 
cross-sections for each of the phases are obtained by sum- the column densities of all other atomic types can be de- 
nting the photo-ionization cross-sections of the individual termined and the expected cross-sections can be estimated, 
constituent atoms and ions weighting their c ontributions by Apportioning these atoms between atomic gas, partially ion- 
the ab undances. Following the notation in iRide fc WalQ ized gas, molecules and dust will normally introduce only 
j 19771 ) the cross-section for the gas phase can be written as minor perturbations to the total cross-section. Further de- 
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tails abo ut this mode l of X -ray absorption in the ISM are 
given by I Wilms et~ai1 (|200Ch . 

For extragalactic X-ray sources we can split the total 
measured absorption into two components including both 
the column density in our Galaxy, Nn, g , and the excess col- 
umn density in the host galaxy or elsewhere, Nh,i- If we 
know the redshift of the host then we can express the excess 
column in the rest frame of the host. However, if the spec- 
tral resolution is modest then individual absorption line fea- 
tures will not be visible and the energy profile of absorption 
is not expected to be a strong function of redshift meaning 
we cannot fit both components simultaneously. We must fix 
Nn.g if we want to extract a meaningful estimate of Nh,i- 
Therefore fitted values of the excess absorption for extra- 
galactic sources are critically dependent on our understand- 
ing of aisM and Nu,g for different lines of sight through our 
Galaxy. 

The dominant fraction of the total hydrogen column 
density is atomic , Nhi, (approximately 80% on average 
IWilms et al]|200d ) and this is readily estimated using 21 cm 
radio e mission maps (e.g. the Leiden- Argentine-Bonn (LAB) 
survey IKalberla et al.l i|2005h ). However, mapping the distri- 
bution of molecular hydrogen throughout our Galaxy is dif- 
ficult. The molecule has no permanent dipole which makes 
any radiative transitions weak and hard to detect. Surveys 
in the near-infr ared using the H2 line emission at 2.122 /jm 
(like UWISH2 , iFroebrich et all (|201ll )) readily detect dense 
(uh 2 > 10 3 cm -3 ) giant molecular clouds and supernova 
remnants but fail to detect diffuse emission from the low 
density interstellar medium (ISM) (uh ~ 1 cm -3 ) because 



of the surface brightness limit, 



10" 



W nT 



and the presence of numerous stars. Molecular hydrogen in 
the ISM is most readily observed using absorption lines from 
the first six rotational levels (J = 0-5) seen in the ultravi- 
olet between 900 and 1130 A. A sparse map of the column 
density, Nh 2 molecules cm -2 , across the Galaxy has been 
produced u sing FUSE UV spectroscopy on 73 extragalac- 
tic targets, |Wakkeri r2006), but with so few lines of sight it 
is difficult to construct a reliable and detailed picture. Fur- 
thermore all the targets considered were at high Galactic 
latitude with a maximum total hydrogen column density of 
~ 10 21 cm~ 2 so the distribution at high density in or near 
to the Galactic plane is not included. These data indicate 
that the molecular fraction of the column density along the 
line of sight, defined as 



f(H 2 



2N, 



H 2 



Nhi + 2N H2 ' 



(4) 



is highly variable at high Galactic latitudes and is not tightly 
correlated with the atomic gas column density, Nhi atoms 
cm -2 , or the total hydrogen column. 

The afterglow emission of Gamma Ray Bursts (GRBs) 
is seen at all wavelengths from X-ray through to radio 
and is thought to be synchrotron radiation arising from 
the shock produced w hen the GRB jet impac t s the sur- 
round ing medium (e.g. iMeszaros fc Reed i| 19921 ) ; Is ari et al.l 
(|!998h V The X-ray emission (0.3-10 keV) from GRB af- 
tergl ows is routinely me asured today by the Swift satel- 
lite |Gehrels et al.l |2004 ) and in almost all cases the soft 
X-ray spectrum is well modelled by a simple power law 
continuum modulated by photoelectric absorption at low 
energies. Absorption is expected from gas in the Milky 



Way along the line of sight but for most Swift GRBs 
there is also evidence for significant absorption in excess of 
the Galactic foreground. This is attributed to gas in the 
host galaxy and/or intergalactic clouds along the line of 
sight and ha s been s tudied in detail by ma n y authors, see 
iBehar et all j|201ll); ICampana et all il2012T) ; ISchadv et ail 
l|201ll ); IWatsonl ^Olll ); IZafar et alTl|201ll ) and references 
therein. Because the X-ray afterglows are bright and have 
simple continuum spectra they provide an excellent oppor- 
tunity to study the cold absorbing material in our Galaxy 
as well as the ISM in the distant host galaxies and this is 
what we turn attention to here. 



2 FITTING GRB X-RAY AFTERGLOW 
SPECTRA 

We selected X-ray afterglow spectra measured by the Swift 
X-ray Telescope (XRT) in photon counting (PC) mode from 
GRBs up until 2011 November 3. We did not use data 
obtained in Windowed Timing (WT) mode to avoid, as 
far as possible, early times post-burst when spectral evo- 
lution, probably associated with the prompt emission, is 
most likely to be occurring. For a few GRBs there is evi- 
dence for an additional, early, thermal continuum compo- 
nent which is probably associated with the prompt emis- 
sion or a supervova (SNe) rather than the afterglow. One 
such object for which extensive PC mode d ata were ob- 
tained is GRB 060218 (ICampana et al.l |2006[) and we ex - 
cluded this from our sample. GRB 090 618 (|Page et al.ll201ll ) , 
GRB 100316D (IStarling et al.l lioTll ) and GRB 101219B 
l|Starling et al.l I2012T I have similar early spectra but such 
complications are only seen in WT mode so these were in- 
cluded in our analysis. We fitted the GRB afterglow spec- 
tra with a simple model comprising a power-law continuum 
specified by two fitted parameters, a photon index and a 
normalisation, and two absorption components representing 
the Galactic absorbing column (with fixed column density) 
and the excess absorbing column for which the column den- 
sity was allowed to float. The Nn, g values used were take n 
from the LAB 21 cm all-sky survey, IKalberla et al. I (|2005h . 
using the FTOOLS NH procedure on the GRB positions. 
Fig. [T] shows the values for the fixed Galactic, Nn, g , and 
fitted excess Nn,i, column densities from spectral fits of 493 
afterglows. 

The fittin g was done using XSPEC version 12.7.0 
(lArnaudl Il996t) ut ilizing the absorption model tbabs Q, 
IWilms et al.l (|2000r) . This model includes elemental abun- 
dances tabulated by the same authors and these abundances 
were used for all the spectral fitting reported below except 
for those cases mentioned explicitly in the text. The Nh,x 
values plotted are the fitted values (excess absorption over 
and above the fixed Galactic value) assuming a redshift of 
zero (even for those bursts for which we know the redshift). 
We note that in this paper we are not concerned with the 
physical origin of the excess absorption in GRB afterglow 
spectra, and using the complete set, irrespective of whether 



1 The XSPEC command variants ztbabs and tbvarabs were also 
used extensively in the analysis to provide direct access to the 
abundance, dust grain and redshift parameters. 
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Figure 1. The results from fitting 493 individual GRB afterglow 
spectra. Top panel: The fitted values of excess hydrogen column 
Nff i (assuming z = 0) plotted against the fixed values of Njj g . 
Upper limits (3 sigma) for Njf j are shown as arrows. No corre- 
lation is expected between these two column densities but even 
allowing for the upper limits it is clear that Nu,i is systematically 
increasing with Nu,g- We note that a Galactic column density of 
Nf] g = 10 21 cm -2 corresponds to a average Galactic latitude 
of -13 degrees in the South and +22 degrees in the North. Some 
29% of the sky has N HiS > 10 21 cm" 2 . Bottom panel: The dis- 
tribution of fitted Nn,i- The smooth curve is a log-normal fit to 
the distribution. The dotted histogram shows the distribution of 
the upper limits. 



a redshift has been determined, provides a much larger sam- 
ple for the analysis and removes any risk of optical bias. 
The underlying GRB redshift distribution should, of course, 
be independent of position with respect to the Galaxy. The 
paucity of points in the bottom right of the plot is not imme- 
diately surprising because if the Galactic column is high the 
sensitivity to an excess column is reduced. We would expect 
the points from the lower-right (low Nh,i and high Nh, s ) to 
become upper limits in the upper-right of the distribution. 
In fact there are very few upper limits for objects with high 
Nh, 3 and for these values the fitted Nh,i are higher than av- 
erage. The distribution of fitted Nn,i values is shown in the 
bottom panel. The observed distribution is slightly skewed 
with respect to the log-normal fit and there appears to be an 
excess bump in the high tail. The distribution has a mean 
value of 2.1 x 10 21 cm -2 and a rms width of 0.48 dex. 

Although the X-ray afterglows of GRBs are exception- 
ally bright compared with other extragalactic sources, they 
are only visible over a relatively short time window and 
the spectra of some bursts have limited statistics, which of 
course is ultimately the reason for the upper limits on excess 



absorption for many GRBs. The distribution of upper limits 
is shown relative to the whole distribution in the lower panel 
of Fig. Q] They are clearly biased towards the lower tail of 
the Nh,i distribution, as expected, and we need some way 
of including these lower limits in our analysis if we are to 
understand what is going on. In order to investigate further 
we sorted the bursts into ascending order of Nh, 3 and se- 
lected GRBs in groups of 26, each group containing bursts 
that are expected to suffer a similar degree of Galactic ab- 
sorption. This gave 18 groups of 26 and a final 19th group of 
25. We summed together the count spectra in each group to 
produce 19 composite spectra and then fitted each compos- 
ite spectrum using XSPEC. Tools are provided along with 
XSPEC to sum the source spectra, background files, the 
area files (ARFs) and the response files (RMFs) to facilitate 
accurate fitting of composite spectra. The component spec- 
tra from the GRBs were extracted using grade 0-12 events 
from PC mode and the appropriate ARF and RMF files were 
automatically selected for the correct epoch. We didn't ap- 
ply any binning to the raw spectra because the composite 
spectra contained more than adequate counts. Pile-up was 
avoided by excising the core of the PSF when appropriate. 
The ARF files were summed weighting by the total counts 
in the spectrum. It was found that only two significantly 
different RMF files were required, before and after the sub- 
strate voltage change which was performed 2007-09-01. We 
summed these weighting by the accumulated time before and 
after the change over date. We checked this summing pro- 
cedure using different weightings and different combinations 
of ARFs and RMFs. Because each group contains 26 GRBs 
the results were remarkably insensitive to the weightings or 
components used. The average photon index of the power 
law continuum fitted for the individual GRBs was F = 1.96 
and the rms scatter was small, 0.18. We therefore expect 
the continua of the composite spectra to be well represented 
by a simple power law (with similar photon index) and we 
fitted the composite spectra using the same model as used 
for the individual GRBs. We fixed the redshift for the ex- 
cess column as z — and fixed the Nh, s to the mean for 
each group. Because the range of Nh, 3 within each group is 
narrow small changes in the fixed Nu,g values used make no 
difference to the results. The mean value of the excess ab- 
sorbing column density required increases with redshift but, 
crucially, the quality of the fit and the scatter of the excess 
absorption column densities remain the same. 

Fig. [2] shows the fitted Nha values for the 19 groups 
plotted against the fixed Nh, s values. Following the conven- 
tional approach, already used for the fits shown in Fig. [T] 
we used the Nh, 3 values taken from the LAB 21 cm all- 
sky survey. The error bars in Nh, 3 indicate the range of 
Galactic absorption within each group while those in Nn,i 
represent the 90% range for the fitted values. Crucially all 
the groups return a well defined Nh,% value with a 90% range 
and there are no upper limits. We expect there to be no cor- 
relation between the excess absorption in the GRBs and the 
column density within our Galaxy but the result of fitting 
the groups clearly indicates a strong trend. We can quantify 
the significance of the trend by calculating a reduced Chi- 
squared value against a constant model for Nn,i, xt ~ 64.4. 
In evaluating x 2 we use the formal fitting errors, although 
this will underestimate the true uncertainties given the ran- 
dom variation in the GRB afterglow properties from group 
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Figure 2. The fitted values of N H ,i, Z = 0, for 19 GRB groups 
plotted against the fixed values of Galactic hydrogen column 
Nb g- The solid horizontal line indicates the geometric mean and 
the dashed lines ±1 sigma in dex. 



to group. Assuming the width of the distribution of Nu,% 
values in Fig. [1] is representative of the GRB population, 
convolved with the measurement errors, we expect the rms 
between groups of 26 GRBs to be « 0.09 dex. Using this we 
can estimate the contribution to reduced Chi-squared from 
the scatter in Nh,i giving an expected value of reduced Chi- 
squared as < xi >= 17.3. The trend is obviously significant. 
Using linear regression we estimate that the total swing in 
the Nn,i values fitted across the full range of Nn, g values 
in the groups is a factor of 6.0. Either there is something 
wrong with the calibration of the data, the spectral fitting 
procedure is faulty or the model for the Galactic absorption 
is incorrect. 

There is a spread of GRB brightnesses within each 
group but if only one or two GRBs were to dominate this 
might bias the results. In fact the brightest GRB in each 
group contributes somewhere in the range 10-15% of the 
total count so a single GRB is not dominant in any group. 
However the distribution of brightnesses of the GRBs in each 
group is obviously important. It reduces the effective num- 
ber of GRBs contributing to the averaging and thereby in- 
creases the scatter we expect from the redshift dependence 
and intrinsic spread in Nu,i values. We return to this point 
later and show that the chosen grouping of the GRBs does 
not bias the results. 

We inspected all the group fits to check for anoma- 
lies. Fig. |3 shows the count rate spectrum and fitted pro- 
file for the 16 th group which is typical of the groups with 
Nh. 3 > 10 21 cm -2 . The solid curve is the best fit includ- 
ing the fixed Galactic column, Nh, 3 and the fitted excess 
column, Njj.i- The statistics are excellent and there are no 
features or large residuals to suggest a fault with the cali- 
bration or the fitting procedure. The dashed line is the pre- 
dicted spectrum with the excess absorption component set 
to zero, Nn,i = 0. The count rate residuals are shown in 
the lower panel. The excess absorption signature is clearly 
significant and covers a wide range of the low energy bins. 
For the GRB groups with Nu,g < 10 21 cm -2 the excess ab- 



sorption signature is much larger than the example shown 
in Fig. El 

The XRT calibration influences the fitting of the ab- 
sorption component in three ways. The first and dominant 
factor is the roll-off of the efficiency below ~ 1.0 keV. If the 
efficiency roll-off is too shallow then the fitted column den- 
sity must increase to match the detected count rate profile. 
Conversely, if the roll-off is too steep, the fitted column den- 
sity will be too low. The second factor is the broad efficiency 
profile and, in particular, the roll-off of the efficiency above 
~ 2 keV and the ratio of the collecting area at high ener- 
gies, above ~ 2 keV, to that at low energies, below ~ 1 keV. 
These factors determine the profile of the power law con- 
tinuum spectrum. If the high energy roll-off is too shallow 
then the fitted spectral index will be too large and the fitted 
column density must increase to compensate. Conversely, if 
the fitted index is too small then the fitted column density 
will be too small. The third factor is the redistribution ma- 
trix which dictates the detailed shape of the predicted count 
distribution. Any mismatch between the predicted and ob- 
served count profile degrades the goodness of fit and can 
lead to systematic errors in the fitted parameters including 
the column density. 

According to the latest XRT calibration release note 
(version 16) a systematic error of ~ 3% is required when fit- 
ting high statistical quality spectra using the current RMF 
and ARF response files. Including such a systematic error 
has no effect on the results plotted in Fig. The magnitude 
of the absorption fitted depends on the area calibration as 
a function of energy, in particular over the low energy range 
0.3-1.5 keV. If the roll-off of this area set by the calibration 
is incorrect, the absorption will be over- or under-estimated 
accordingly as discussed above. In order to determine the 
sensitivity of the fitted Nhj results reported throughout this 
work we repeated all the analysis with three versions of the 
Swift XRT calibration. Firstly the currently released ver- 
sion (SWIFT-XRT-CALDB-09_vl6) and secondly two trial 
versions generated by employing plausible enhancements of 
the known uncertainties in the hardware calibration: the de- 
tector QE, the filter transmission using updated absorption 
coefficients and measurements and mirror X-ray reflectivity 
incorporating a contaminating carbon overcoat. The trial 
calibrations comprise both new RMF and ARF files. A com- 
parison of the areas for these calibrations is shown in Fig. 
[U Significant differences are the depth of the oxygen K-edge 
at ~ 0.54 keV and the detailed structure across the silicon 
K-edge at ~ 1.8 keV and the gold M-edges 2.2 - 4.0 keV. 
The triall and trial2 curves in Fig. [4] have been scaled so 
that the differences in the roll-off at low energies is most 
obvious. For triall the roll-off below 1 keV is ~ 20% shal- 
lower than the current calibration and for trial2 this roll-off 
is ~ 20% steeper. The differences between the true calibra- 
tion and either version 16, triall or trial2 are very unlikely 
to be greater than the differences visible in Fig. 3] We note 
that the trial XRT calibrations are not intended as a bet- 
ter approximation to the true state of the instrument but 
they serve to illustrate that significant changes to the ef- 
fective area calibration do not alter our results. Using the 
triall XRT calibration increases the fitted Nn,i values by 
around 25% and the total swing in the Nh,i values fitted 
across the full range of NH, g values in the groups is a factor 
of 5.7, a little less than the swing using the current cali- 
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Figure 3. Top panel: The measured count spectrum for the 16 th 
composite group. The solid line indicates the best fit model. The 
dashed line shows the model prediction when Nh i = 0. Bottom 
panel: The count rate residuals. The solid curve and dots show 
the residuals when Nhj = 0. 
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Figure 4. Comparison of the effective area as a function of energy 
for the current (version 16) XRT calibration (solid curve), our 
triall calibration (dashed line), and our trial2 calibration (dot- 
dashed line). The triall and trial2 values have been scaled so all 
three curves have the same area at ~ 1.85 keV. 



bration (6.0). Using the trial2 decreases the fitted Nu,i by 
around 20% and increases the swing factor to 10.4. some- 
what larger than for the current calibration. In addition to 
these extreme versions of the calibration we also tried vary- 
ing the RMFs and ARFs individually through intermediate 
values. These changed the results in the same sense as triall 
or trial2 although the changes were always smaller. 

The primary influence of all the calibration factors on 
the fitted column density is a simple scaling because the in- 
strument response and the fitting procedure are, essentially, 
linear. If we change the roll-off in the effective area below 
~ 1 keV by 10% then we expect the fitted Nh to change 
by ~ 10%. This scaling is independent of the absolute value 
of Nh- However, the trend in the fitted column density val- 
ues which we are trying to account for is not the result of a 
simple scaling error. We conclude that uncertainties in the 
calibration cannot account for the observed trend seen in 
Fig. [2] and we believe that the XSPEC fitting procedure is 
well tried and tested. 

3 MODIFYING THE GALACTIC 

ABSORPTION MODEL PARAMETERS 

3.1 Global scaling 

The default implementation of the I Wilms et all l|2000l ) ISM 
model in the XSPEC tbabs model includes a molecular hy- 
drogen fraction, /(-H2) as defined in Equation [4] of 20%. 
It assumes that the Nntot fit parameter is the sum of 
the column densities (both expressed in terms of hydrogen 
atoms per unit area) of HI and H2. Therefore, employ- 
ing the widely adopted conventional approach and setting 
Nntot = Nhi from 21 cm surveys is strictly an incorrect use 
of this routine. The measured HI column should be multi- 
plied by 1.25 so that 20% represents the molecular column 
and 80% the HI fraction. Fig. [5] shows the distribution of 
fitted Nh,i if we adopt this value for the Galactic column in 
the group spectral fitting. The bursts in the last (19th) group 
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Figure 5. The Nn,i values for the 19 GRB groups using Nn,g 
equal to 1.25 X Nhi- The apparent discrepancy between the bin 
to bin scatter and the 90% error bars may not be very surprising, 
given that each group contains GRBs at a range of redshifts and 
true (intrinsic) columns. The horizontal lines are as in Fig. [2] 

have a very high mean foreground Nh, 3 ~ 10 22 cm~ 2 , and 
are typically at galactic latitudes < 2.0 degrees. Thus their 
lines of sight pass through complex regions of the disk, and 
it would be rather surprising if estimates of the foreground 
absorption were at all accurate. Nonetheless, in Fig. [5] the 
19th group is, within the large formal errors, consistent with 
the Nu,i value for the low foreground bins. The remaining 
groups still show the same unexplained trend as before. The 
reduced Chi-squared is now \t = 40.9 while the expected 
value is < xt > = 13.5. If the triall calibration described 
in Section [2] is used Fig. [5] is very similar except the mean 
value of Nh,i is increased by 25%. The trial2 calibration 
decreases the mean value by 18%. 

A simple scaling of the measured Nhi column density 
(or equivalently including a fixed percentage for molecular 
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Figure 6. The Njj,i values for the 19 GRB groups obtained using 
the abundances of Anders & Grevesse (1989). The horizontal lines 
are as in Fig. [2] 

hydrogen) is unable to explain the apparent variation in ex- 
cess GRB column density as a function of the absorbing 
column in our Galaxy. An alternative explanation for the 
apparent variation might be an error in the elemental abun- 
dances assumed. The lWilms et al] l)200(J ) model includes up- 
dated abundances which are ap proximately 70% of the pre - 
viously widely used values from I Anders fc Grevessej |l989). 
To test the e ffect of this change we re -ran the group fit- 
ting using the I Anders fc Grevessd (|l989T ) abundances for the 
Nh, 3 component. The result is shown in Fig. [5] The appar- 
ent variation remains and the Galactic absorption used for 
the top Nh,q group is now too large and results in an up- 
per limit which is lower than the fitted values for all the 
other groups. Using the trial calibrations to produce Fig. [6] 
makes little difference. Using triall the Nn,i value for the 
19 th group is still too low but somewhat higher and closer to 
the average value. If the trial2 calibration is used then the 
19* is near the average value but Nh,i for the lower Nn, g 
drop away in the same trend as Fig. [S] 

We also tried varying the global percentage of molec- 
ular hydrogen from to 30% and varying the elemen- 
tal abundances between values consistent with IWilms et al.l 
|200d ) and lAnders fc Grevessd l| 19891 ) but no combination 
improved the situation. We conclude that the problem is 
not resolved by a simple scaling of either the hydrogen den- 
sity or the elemental abundances independent of position 
on the sky. What is required is a modification of the ISM 
model as a function of direction across the Galaxy. In par- 
ticular the absorption associated with the mid-range values 
of Nn, g , around 10 21 cm -2 , needs to be increased while the 
lowest and the very highest absorption values should remain 
as they are. In all the subseq uent analysis we rev erted to us- 
ing the abundances given in IWilms et all (|2000l ). 

3.2 The column density of molecular hydrogen as 
a function of Nhi 

The fraction of hydr ogen in molecular form is known to be 
highly variable (e.g. iRachford et~alll2002l ). In order to get 




Figure 7. Upper panel: The fitted total hydrogen column densi- 
ties, Nntot, for the 19 group spectra plotted vs. the Nhi average 
for each group. The stars show the Nhi component (from the 
LAB survey) and the horizontal lines indicate the mean and rms 
range of the expected excess column density component, Nn,i, 
constant across all groups. The solid rising curve is the sum of 
LAB Nhi and Nn,i- Lower panel: The Nh 2 column density de- 
rived by subtracting the Galactic HI and excess HI contributions 
from the total. Those Nh 2 values plotted with downward pointing 
arrows are upper limits. The solid curve shows the simple model 
adopted for the molecular hydrogen column density as a function 
of the HI column density. 



a handle on the possible variation in the column density of 
molecular hydrogen we fitted the 19 group spectra with a 
single absorption component giving a total column density 
estimate, Nmot, for each group. This fitted total is expected 
to be the sum of the Galactic and excess components. Fig. 
[7] shows these single fitted values as a function of the av- 
erage LAB survey Nhi values for each group. We expect 
Nntot to be the sum of contributions from Galactic Nhi, 
excess Nhj and Galactic Nh 2 ■ The lower panel shows the 
estimate of Nh 2 obtained by subtracting the other two com- 
ponents from the total. To produce these values we assumed 
that the excess component has a mean of 9.4 x 10 20 cm -2 
(which is the average for the first 4 groups and very close to 
8.9 x 10 20 cm~ 2 given by the results in Fig. [SJ) and an rms 
value estimated from the width of the distribution from the 
individual fits shown in Fig. [1] For Nhi > 10 21 cm -2 the 
putative molecular hydrogen column is high and constant, 
independent of iV^ / , and below this it drops away although 
it is difficult to determine the functionality of the decline 
because the sensitivity is limited by the intrinsic scatter of 
the Nn,i values for the individual GRBs. Note that 6 of the 
groups below Nh, 3 = 5 x 10 20 cm -2 have Nh 2 values which 
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are upper limits. We repeated the analysis summarised in 
Fig. using the triall calibration described in Section [2] 
The Nntot values returned were 2.2 x 10 2 ° cm~ 2 higher, av- 
eraged across all groups. For groups with Nntot < 1.5 x 10 21 
cm -2 this corresponds to an average increase of 16% while 
for those with Nntot > 1-5 x 10 21 the average increase is 8%. 
We had to increase the assumed mean value of the excess 
component to 1.1 x 10 21 cm~ 2 , in line with the triall calibra- 
tion results returned when plotting Fig. [5J but the estimated 
Nh 2 values in the lower panel were the same within the lim- 
its imposed by the error ranges plotted. Using the trial2 
calibration shift things in the opposite direction with Nntot 
~ 18% down but otherwise the result is the same. This con- 
firms that the excess Galactic absorption, attributed to the 
molecular hydrogen column density in Fig. [7] is not an arti- 
fact of uncertainties in the calibration. It also demonstrates 
that the limits of uncertainties in the calibration introduce a 
systematic error of ~ 16% in the total fitted column density 
for low columns and ~ 8% when Nntot > 1-5 x 10 21 cm -2 . 

Given the results plotted in Fig. [7] we initially assumed 
that the molecular hydrogen column density throughout the 
Galaxy has the profile indicated by the solid line plotted in 
lower panel of Fig. [7] This has the form 



Nn 2 = N 



H 2 max 



exp 



N c 



(5) 



and we set the total Galactic hydrogen column density as 



N H , g = Nni + 2N„ 2 



(6) 



taking Nni from the LAB 21 cm survey. If Nni « N c then 
Nn 2 approximates to a simple power law with index a and 
if a = 1, Nn 2 is a simple fraction of Nni- If Nni >> N c 
then Nn, g asymptotes to Nn 2 max- So N c is a characteris- 
tic hydrogen column density where the powerlaw increase 
in Nn 2 flattens off to form a plateau with the maximum 
value Nn 2 max- The profile in the lower panel of Fig. [7] was 
plotted using Nn 2 max = 7.5 x 10 20 cm" 2 , N c = 1.5 x 10 21 
cm -2 and a — 2. We did not pursue any direct fitting of this 
profile using the fitted Nntot values because of the difficulty 
of handling the upper limits for Nn 2 that arise when Nni 
is low. Instead we performed a 3 dimensional grid search to 
find the best fit model which gave the least rms scatter for 
Nn.i across the 19 groups. At each grid point we fitted the 
19 group spectra with 2 absorbing components, Nn.i and 
a fixed Nn, a including Nn 2 specified by values of the pa- 
rameters Nn 2 max, N c and a. Fig. [5] shows the result with 

Nn 2 max = 7.5 X 10 20 '---1- — " 2 <«■ - OQ-7 „ m21 

atoms cm -2 and a 
groups is now significantly reduced to rms 0.128 dex closer to 
the expected value of ~ 0.09 dex. The reduced Chi-squared 
is x 2 = 33.1 while the expected value given the intrinsic scat- 
ter in Nn,i (rms 0.09 dex between the groups of 26 GRBs) 
is < x 2 >= 13.8. So some of the scatter in Fig.[8]most likely 
arises from errors in modelling the Galactic column density. 
The geometric mean value of Nn,i for the 19 groups in Fig. 
[8] is 8.8 x 10 20 cm -2 (z = 0.0) which is a significantly lower 
than the value for the distribution in Fig.[JJ( 2.1 x 10 21 cm -2 ) 
and very close to the mean in Fig. [5] and the value assumed 
to plot Fig.[7](8.9x 10 20 cm -2 ). This is understandable since 
our estimate of the Galactic contribution to the absorption 
has now increased and the upper limits present in the fitting 
of individual sources have been eliminated using the groups. 



J0 molecules cm -2 , N c = 2.37 x 10^ 
2. The scatter in Nn,i across the 




N u „ cm 

H.g 



Figure 8. The Nn,i values for the 19 GRB groups using Nn,g 
from Equation \E\ and using Nn 2 from Equation [5] The errors 
plotted for Nn,i are the 90% ranges. The horizontal lines are as 
in Fig. [2] 



The estimated mean and rms scatter values of the excess 
component are plotted in Fig.[7]as the horizontal lines. They 
largely encompass the fitted Nntot values for low Nni where 
the excess column dominates. Finally, we note that the fit- 
ted values of Nn 2 max and iV c are dependent on the Nni 
values used. Authors of the LAB survey and similar 21 cm 
surveys caution that saturation of the 21 cm emission may 
lead to inaccurate or under estimates of the atomic hydro- 
gen column density when Nni >> 10 21 cm~ 2 . Correlation 
of the LAB 21 cm map with extinction maps (E(B-V) de- 
rived from IR emission, see next section) indicates that any 
saturation is not very significant until Nni > 10 22 cm -2 . 
However, we acknowledge that some fraction of the column 
we are ascribing to H2 may actually be HI that has been 
lost due to such saturation. 



3.3 Including dust extinction in the ISM 
modelling 

So far we have not considered interstellar dust. Dust is rel- 
evant because it locks up some metals from the ISM, which 
are not therefore seen in gas phase abundance measure- 
ments. As noted previously, these metals still contribute to 
the soft X-ray absorption, albeit with some shielding due 
to the thickness of the grains. We assume that this absorp- 
tion is already correctly accounted for (see also Section [4} , 
but dust has an additional role in catalysing molecule for- 
mation, and we consider that process in this section. Most 
of the H2 in the ISM is probably created on the surface 
of dust grains. The production rate is dependant on the 
collision probability between H atoms and the grains and 
the amount of gaseous molecular material released will de- 
pend on the sticking probability of H2 on the grains. The 
competing dissociation rate depends on the temperature, 
cosmic ray and X-ray flux and collisions within interstellar 
shocks. For details the reader i s referred to the full discus- 
sion m IShull fc Beckwithl (|l982h and the references therein. 
For lines of sight out of the Galactic plane, with cold, low 
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Figure 9. The Nfj,i values for the 19 GRB groups using Nn,g 
from Equation \E\ with Nh 2 from Equation [jj The errors plotted 
for Nh i are the 90% ranges. The horizontal lines are as in Fig. 
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density material, we expect the Nh 2 column density to be 
proportional to the product NmNdust- As we move closer 
to the plane the density will increase although the ratio of 
Nh 2 /Nhi will be variable depending on local conditions. 
The profile derived from our initial analysis, shown in Fig. 
[7] indicates that there is an upper limit to the local Nh 2 of 
~ 10 21 molecules cm -2 , independent of Nhi- 

We modified the functionality of the Nh 2 profile so that 
it includes the extinction measure E( B — V), using t he all - 
sky maps of E(B - V) produced by ISchlegel et all (| 19981 ) 
from IRAS and COBE/DIRBE infra-red 100 /Ltm and 240 
/im data. These E(B — V) values are derived by correcting 
the measured IR emissivity using the measured temperature 
and assuming a constant reddening law so that E(B — V) 
is directly proportional to the dust column density, Ndust- 
In order to produce Nhi and E(B — V) values with the 
same limited angular resolution we rebinned the extinction 
maps into an all-sky Aitoff projection array with a pixel size 
of 0.75 degrees. This is identical to the array used by the 
FTOOLS NH procedure so we could use this to return both 
the LAB survey Nhi and E(B — V) values for every GRB 
position (or any other position). The molecular hydrogen 
column density profile was set as 



Nh 2 = N 



H 2 max 



exp 



-NmEjB - V) 
N c 



(7) 



Fitting the grouped spectra adopting this measure for Nh 2 
and, again, searching for values of the parameters Nn 2 max, 
N c and a which produced the minimum rms dex scatter 
gave the results shown in Fig. [5] The best fit parameters 
are N H2 max = (7.2 ± 0.3) x 10 20 molecules cm" 2 , iV c = 
(3.0 ± 0.3) x 10 20 atoms cm" 2 and a = 1.1 ± 0.1 where 
the errors quoted were estimated from changes in x 2 values 
calculated using the confidence ranges on Nh,i produced 
from the spectral fits (and plotted in Fig. |5J. We note that 
the parameters which gave the minimum rms also gave the 
minimum \ 2 ■ The rms scatter in the fitted Nhj values is 
now 0.087 dex, visibly less than for the distribution shown 



37 x 13 + 1 x 12 
25 x 19 + 1 x 18 
18 x 26 + 1 x 25 
17 x 29 



1.06 ±0.07 1.87 ±0.16 7.06 ± 2.38 

0.85 ±0.05 1.84 ±0.13 7.00 ±1.90 

0.84 ±0.02 1.83 ±0.12 6.85 ± 1.75 

0.82 ±0.03 1.82 ±0.12 6.90 ±1.78 



Table 1. The mean and rms scatter of fitted spectral parameter 
values using different groupings with Nn, g from Equation [6] and 
Nh 2 from Equation[jJ The groupings are given as #binsx#GRBs 
per bin. Nh ,% 10 21 cm -2 is the geometric mean of the excess col- 
umn density (at z=0). The photon index is T and the normalisa- 



tion is Norm 10 ph cm 



keV" 



in Fig.[8]and close to the value w 0.09 dex expected from the 
original distribution obtained from the individual spectral 
fits. The reduced Chi-squared is now \ 2 = H-l while the 
expected value is < xt > = U-7. Including the molecular 
hydrogen column density in the ISM model using the profile 
given by Equation [7] has eliminated the unexpected trend in 
the fitted Nn,i values seen in Figs. [2] and [5] 

In order to check that the results are not sensitive to 
the particular distribution of GRBs within the 19 groups 
used, we performed the analysis using different groupings. 
The mean and rms of the fitted parameters taken across the 
groups are given in Table [T] The mean photon indices and 
normalisations are reasonably constant. When the number 
of GRBs in each group is only 13 the scatter in the normal- 
isations and excess column density values is largest, as ex- 
pected. For the grouping plotted in Fig.[9l 18 x 26 + 1 x 25, 
the scatter in excess column density is significantly lower. 
Increasing the group size further to 29 makes little differ- 
ence although the scatter is now starting to increase. This 
is because some of the groups now span too large a range in 
Galactic absorption, a factor of 2 to 3, and using a single, 
fixed value for Nh, 3 is no longer a good approximation for 
the combined spectra. The results are clearly not sensitive 
to the particular GRB groupings used although the chosen 
grouping gives the best compromise between using too few 
GRBs per group and getting a large scatter in Nha and 
using too many GRBs per group and losing resolution in 
Nh, 3 . 

We repeated this analysis using the triall and trial2 
XRT calibrations described in Section [2] The best fit pa- 
rameter values were the same within the quoted errors. The 
rms scatter was larger, 0.090 dex using triall and 0.13 us- 
ing trial2. The only really significant difference was the ge- 
ometric mean of Nh,i across all the groups. In Fig. [9] this 



is 8.4 x 10 2 
1.05 x 10 21 
0.66 x 10 21 



cm 
cm' 



whereas using the triall calibration gave 
some 25% higher and using trial2 gave 
22% lower. 



4 COMPARISON OF THE Nh 2 DISTRIBUTION 
WITH OTHER MEASUREMENTS 

How does the molecular hydrogen column density profile 
inferred from GRB afterglow X-ray spectra compare with 
other measurements? Fig. [10] shows the molecular hydro- 
gen column measure d in UV absorpt ion to stars taken from 
Savage et all (|l977l ) (74 objects), iRachford etafl (|2002l . 
2009) (38 o bjects) and 66 extragalactic high latitude sources 
taken from lWakkerl l|2006l ). The molecular hydrogen column 
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Figure 10. The molecular hydrogen column as a function of 
NhiE(B — V). The Copernicus measurements from Savage et 
al. (1977) are plotted as blue circles (sources with only upper 
limits for Nh 2 are not included). The FUSE measurements from 
Rachford et al. (2002,2009) are plotted as green stars/open stars 
and the extragalactic FUSE measurements from Wakkcr (2006) 
are plotted as red squares. The solid line is the profile inferred 
from the X-ray GRB afterglow spectra. 



density is plotted as a function of NhiE(B — V) derived 
from the LAB survey and IRAS and COBE/DIRBE extinc- 
tion maps in the same way as described above for imple- 
menting the Nh 2 profile (Equation [7J . The profile derived 
from the X-ray absorption in GRB afterglows forms a rea- 
sonably well defined upper envelope to the distribution of 
Nh 2 points. The plateau for NhiE(B - V) > N c is well 
matched between the UV and X-ray confirming that Nh 2 
is independent of the atomic column density and the dust 
extinction in the Galactic plane. There is clearly a very large 
range of UV measured values below the upper envelope. The 
X-ray absorption measurements suggest that a large fraction 
of the molecular hydrogen is not seen in the UV absorption 
spectra. This may be because the absorption line features 
seen in the UV are produced by dense clouds of gas which 
have a well defined velocity with little dispersion. For many 
lines of sight through the galaxy a significant fraction of the 
molecular hydrogen may be distributed at much lower den- 
sity and with significant velocity dispersion. This material 
would produce very broad absorption features in the UV 
spectra or possibly a large number of small line features at 
different velocities which cannot be detected or identified. 
M ost of the FUSE observation s of bright sources reported 
bv lRachford et all (|2002l . I2009T ) are close to the X-ray pro- 
file indicating that when the signal to noise and sensitivity 
are good all the molecular hydrogen can be detected in UV 
absorption. It is possible that some of these bright FUSE 
sources are not subject to the entire Galactic column den- 
sity. This might explain why two sources, HD 164740 and 
HD 186994, have Nh 2 values significantly lower than pre- 
dicted by the model. 

The FUSE source which is closest to a GRB position 
is HD154368, 0.86 degrees from GRB 100504A. For this 
GRB the N H , g is 2.6 x 10 21 cm" 2 and the fitted NH,i is 
3.8 x 10 21 cm" 2 . This is consistent with the FUSE result of 



cm" 2 and the fitted N H ,i is 8.4 x 10 2 
with the FUSE value of Nh 2 = 3.80 x 10 20 cm" 2 . FUSE 
source HD206267 is 1.71 degrees from GRB 050422 and 
FUSE source HD179406 is 1.77 degrees from GRB 060105 
but these separations are now significant compared with the 
resolution and variations in the Nhi and E(B — V) maps. 
All other FUSE sources are several degrees or more from the 
nearest GRB and any stacking analysis to try and make a 
more detailed comparison is not possible. 

Emission from the lower frequency rotational transi- 
tions of carbon monoxide (CO) molecules in the radio band 
(the J=l — > line is at 115 GHz) is used as a proxy for 
H2 in stud ies of Giant Molecular Clo uds, (see e.g. iDickmanl 
l|l978h and lGlover fc Mac Low! l^OHl ) 1 ). A simple conversion 
factor X C o = Nh 2 /W C o ~ 2 x 10 20 cm" 2 K" 1 km" 1 s, 
where Wco K km s~ is the intensity of the CO emission 
line (J=l -> 0) integrated over velocity, is used to predict 
the Nh 2 column density of a cloud from the observed Wco- 
The density of the absorbing column can be estimated inde- 
pendently by various means: from the diffuse 7-ray flux pro- 
duced by interactions with cosmic rays, from the measured 
extinction or from the atomic hydrogen 21 cm emission, but 
the presence of H2 is inferred and not observed directly. 

We attempted t o use such a scalin g by rebinning the 
CO survey data from iDame et all (|200ll rl to produce a CO 
map in the same Aitoff projection used by the FTOOLS NH 
procedure (as we did for the E(B — V) mapping described 
above). We extracted Wco values for every GRB position 
and used these to set Nn, g = Nhi + IXcoWco in the fit- 
ting of the group spectra. The extended CO survey covers 
~ 60% of the sky so we assumed that Wco = for GRBs 
which fell outside this area. This is reasonable because those 
sky areas not covered by the survey are known to be regions 
of very low CO emission. We were unable to find a scal- 
ing factor Xco which reduced the rms scatter across the 
19 groups. In retrospect this is not surprising because the 
profile of the molecular fraction (/(f/2) defined by Equation 
[4| predicted using the CO maps is very scattered and not 
peaked around Nutot — 2 x 10 21 cm" 2 as required. In fact 
there is no correlation between Nh 2 given by Equation[7]and 
Wco- CO emission may be a proxy for molecular hydrogen 
in dense molecular clouds but this does not seem to extend 
to the ISM distribution across the Galaxy. We searched for 
other potential correlations between Wco and the dust or 
hydrogen column densities. We found that by far the best 
correlation was with the dust-to-hydrogen ratio defined by 



Rdh 



E(B -V)x 10 2 
Nhi + 2Nh 2 



(8) 



A 



H 2 



1.45 x 10 21 cm" 2 . The next closest is HD40893, 1.64 



The factor of 10 is included so that the ratio has a value 
~ 1 out of the Galactic plane and Equation [7] was used to 
calculate Nh 2 ■ This ratio is plotted as a function of Wco in 
Fig. 1111 For the bottom panel we calculated the average ratio 
values in 20 bins across the full Wco range. These average 
values increase monotonically with Wco (within the noise) 
and are well fitted by the function 



2 Thomas Dame kindly provided us with the mid-latitude exten- 
sion of the CfA CO survey prior to publication and this gave us 
better coverage out of the Galactic Plane. 
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Figure 11. The dust-to-hydrogen ratio, Rdh, as a function of 
CO emission. Top panel: Each point represents a pixel fa 0.6 
deg 2 . Wqo values are available and plotted for ~ 60% of the sky. 
Bottom panel: Average Rdh values calculated for 20 logarithmic 
bins across the full range of Wco- The dashed lines indicate the 
90% range of the scatter in Rdh- Error bars are plotted for the 
averages but these are too small to see except for the bins at the 
top end. The solid line is the model fit described in the text. 

Rdh = 1 + A x W^o (9) 

where A = 0.51 ± 0.04 and 7 = 0.55 ± 0.03. This function 
fit is plotted in Fig. 1111 The correlation is still present if the 
molecular hydrogen component is ignored but the scatter is 
significantly reduced when the Nh 2 term is included. 

The dust-to-hydrogen ratio is important in the context 
of the X-ray absorption model of the Galaxy. As defined by 
Equation [5] the ratio changes from 1.0 out of the Galactic 
plane to a peak of ~ 20 in the Galactic plane. This increase 
could be due to a change in metalicity and imply a gradient 
of the abundances of Az in Equation [3] or it could reflect a 
change in the fraction of the higher metallicity material con- 
densed into dust thereby representing a change in the fiz,i 
values in the same Equation. We have tested both these 
possibilities by introducing, in turn, an abundance gradient 
and a gas depletion gradient into the absorption model used 
for the spectral fitting of the GRB groups. We only var- 
ied these factors for the elements expected to be incorpo- 
rated into the dust (C,0,Na,Mg,Al,Si,S,Cl,Ca,Cr,Fe,Co,Ni 
for which the gas depletion can be set in the tbvarabs rou- 
tine in XSPEC). The group spectral fits are sensitive to the 
abundances: a change of just 20% in abundance over the 
full hydrogen co lumn density range (i.e. ±10% about the 
average given bv lWilms et al.l l|2000l )) is sufficient to signif- 
icantly degrade the quality of the best fit model shown in 



Fig. [9] The opposite is true for the gas depletion factors. 
If they are set to the extreme value of 1.0 (no dust) then 
the quality of the fit degrades but the changes are barely 
statistically significant. Using a value of 0.0 (all dust) intro- 
duces an even smaller degradation. This is because changing 
the gas depletion does not alter the amount of material in 
the column but simply changes the shielding effect of the 
dust grains and this effect is small. We conclude that the 
change in Rdh must be dominated by gas depletion and not 
an abundance gradient although the X-ray absorption mea- 
surements from the GRB afterglows are unable to confirm 
the range in gas depletion implied by Rdh shown in Fig. 1111 
As already demonstrated, the spectral fitting of the grouped 
GRB afterglow spectra serves as a good calibration of the 
elemental abundances assumed. A study of Galactic absorp- 
tion usi n g the afterglow spectra from GRBs, described by 
IWatsonl (|201ll ). concludes that the metalicity of a typical 
Galactic line of sight i s not consistent with the abundances 
given by I Wilms et all l|2000l ). However, if the molecular hy- 
drogen column density is included in the absorption model, 
we have shown that these abundances are correct to within 
~ ±10% and there is no evidence for a large abundance 
gradient as a function of hydrogen column density. 

In general as Wco increases so the dust extinction, 
E(B — V), and hydrogen column density, Nntot, increase, 
but the dust-to-hydrogen ratio given by Equation [9] leads 
to a rather counter- intuitive result: if Wco > 10 K km s _1 
then Rdh — 0.5Wco 5 ■ K we measure the dust extinction 
and CO emission then we can estimate the total hydrogen 
column density as N H tot ~ 2 x 10 22 E(B - V)W^ 55 cm -2 . 

The well known correlation between the dust column 
density (given by E(B - V) or A v = RvE(B - V)) and the 
hydrogen column (given by Nhi) is better represented using 
NHtot, including the molecular hydrogen component. The 
Av — NHtot relation has a fractional rms scatter of ~ 26%, 
less than the Av — Nhi relation for which the fractional rms 
scatter is ~ 33%. Because the dust-to-hydrogen ratio, Rdh, 
increases with Nntot the relationship is not linear. I f E(B — 
V) < 0.1, and adopting R v = 3.1 from IWatsonl (|201lf ). 
then we find the linear relation Nntot = 3.2 x 10 21 cm -2 
Av but for E(B — V) > 0.1 the correlation flattens off as 
Rdh increases. If we ignore the molecular hydrogen we find 
Nhi = 2.0 x 10 21 cm -2 A^? 86 a result very sim ilar to the non- 
linear relationship given bv IWatsonl llioil. However, this 
non-linearity is introduced because of the missing molecular 
hydrogen component and the change in Rdh and is not a 
consequence of a gradient in metalicity. 



5 THE MOLECULAR HYDROGEN FRACTION 

We can calculate the molecular hydrogen fraction using all 
the pixels in the FTOOLS version of the LAB survey all-sky 
image and the rebin ned version of the E(B-V) data from 
SchlcEcl et al.l (|l99Sf ). The result is plotted as a function of 
the total hydrogen column density in Fig. 1121 Each point rep- 
resents an estimate of molecular hydrogen fraction for a sky 
area of ~ 0.6 deg 2 . The scatter is introduced because there 
is considerable scatter in the correlation between Nhi and 
E(B — V). At high column densities the molecular hydrogen 
fraction actually decreases and the scatter gradually disap- 
pears because the Nh 2 column density asymptotes to the 
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Figure 12. The molecular hydrogen fraction plotted as a function 
of the total hydrogen column density calculated for pixels across 
the whole sky. The horizontal line represents the 20% level which 
is the default value in the XSPEC tbabs routine. 



constant value, Nn 2 max- Conversely, for low total hydrogen 
column densities the molecular hydrogen density is increas- 
ing faster than the total column density and we get a peak 
in the /(-H2) profile. The presence of the peak in the f(H%) 
distribution is why the introduction of this model is able to 
improve the fitting of the GRB afterglow group spectra. It 
has the effect of significantly increasing the absorption at 
and around Nhi = 10 21 cm -2 while having relatively little 
impact on the absorption at high and low Nhi values. The 
model using the product NhiE(B-V) rather than just Nhi 
alone offers an improvement because the correlation between 
Nhi and E(B — V) is not tight and the molecular hydrogen 
column depends on both Nhi and E(B — V). The constant 
value of 20% used in the XSPEC tbabs routines is shown 
in Fig. [T^] as the horizontal dashed line whereas we find the 
maximum value of Nh 2 is ~ 66%. Remarkably the mean 
value of /(-H2) produced by the current model over the sky 
is 20. 2%, very close to the value adopted by IWilms et al.l 
|200d ). 

Why does the molecular hydrogen fraction peak at 
Nntot ~ 2 x 10 -21 cm -2 rather than continuing to rise as 
you might expect and why is the Nh 2 column density not 
correlated with the CO emission? We don't know, but we 
comment that the formation and destruction of molecular 
species is dependent on many factors. Theoretical and obser- 
vational studies of molecular clouds indicate that the forma- 
tion of H2 is primarily determined by the time available for 
its formation and the Xco factor relating the presence of H2 
to the CO emission can be highly variable if the metalicity 
and d ensity are low and the back g round UV radiation field is 
high jGlover fc Mac Low! |201ll ); IShettv et all (|201lh V The 
Nh 2 column density model for the Galaxy presented here re- 
flects the average state of the ISM of the Galaxy rather than 
conditions in molecular clouds taken individually. There is 
a large scatter in both the molecular hydrogen fraction and 
the dust-to-hydrogen ratio which must reflect a broad spec- 
trum of local conditions. 



6 CONCLUSION 

Measurements of X-ray absorption in the spectra of GRB 
afterglows indicate that using the distribution of HI as the 
only direction dependent variable in the model of the Galac- 
tic ISM is inadequate to describe variations in the column 
density seen. The discrepancy can be explained by includ- 
ing a molecular hydrogen column density component which 
is a function of the product of the HI column density and 
dust extinction, NhiE(B — V), given by Equation [7] The 
distribution of molecular hydrogen inferred from the X-ray 
absorption measurements is in agreement with direct mea- 
surements of the Nh 2 column density made using UV ab- 
sorption spectra. Summing the column densities of atomic 
and molecular hydrogen we can estimate the total hydro- 
gen column density and calculate a dust-to-hydrogen ratio, 
Rdh- This ratio is shown to correlate with the CO emis- 
sion following the function given by Equation[9] In principle 
Rdh can be used to estimate the gas depletion factors in 
the X-ray absorption model although the present X-ray ab- 
sorption measurements are not sensitive enough to warrant 
the inclusion of this direction dependence. 

If the total effective hydrogen column density (at z = 0) 
is less than ~ 1.5 x 10 21 cm -2 then the systematic error 
on the fitted value, imposed by current uncertainties in the 
area calibration of the Swift XRT, is ~ 16%. For higher col- 
umn densities this systematic error drops to ~ 8%. Further- 
more, we can confi rm that the e l emen tal abundances in the 
Galaxy assumed in lWilms et al.l (|2000r ) are correct to within 
~ ±10% although because of the limited spectral resolution 
of the CCD detector the XRT is not sensitive to the details 
of the ratios between the individual elements. 

If you required an accurate estimate of the X-ray ab- 
sorbing column density in our Galaxy you should include an 
estimate of the molecular hydrogen column densitj0. When 
using the tbabs in XSPEC you should, at the very least, mul- 
tiply the atomic hydrogen column density estimated from 
21 cm survey data, Nhi, by a factor of 1.25 so that 20% of 
the total hydrogen column is molecular. You can improve 
the accuracy by employing Equation [SI calculating the total 
column using Equation [6] and setting the molecular fraction 
appropriately using the tbvarabs model. If you have a Galac- 
tic E(B — V) extinction measure for the source position you 
can further improve the accuracy using the Nh 2 profile given 
in Equation [7] 
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3 Software to calculate Nh 2 and Nntot f° r a given position is 
available at http://www.star.le.ac.uk/-rw/xabs 
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